Detecting Accounting Frauds in Publicly Traded U.S. Firms: A Machine Learning Approach

نویسندگان

  • Bin Li
  • Julia Yu
  • Jie Zhang
  • Bin Ke
چکیده

This paper studies how machine learning techniques can facilitate the detection of accounting fraud in publicly traded US firms. Existing studies often mimic human experts and employ the financial or nonfinancial ratios as the features for their systems. We depart from these studies by adopting raw accounting variables, which are directly available from a firm’s financial statement and thereby can be easily applied to new firms at low cost. Further, we collected the most complete fraud dataset of US publicly traded firms and labeled the fraud and non-fraud firm-years. One key issue of the dataset is that the data is extremely imbalanced, in which the fraud firm-years are often less than one percent. Without re-sampling the data, we further propose to tackle the imbalance issue by adopting the techniques of imbalanced learning. In particular, we employ the linear and nonlinear Biased Penalty Support Vector Machine and the Ensemble Methods, both of which have been proved to successfully handle the imbalance issue in the machine learning literatures. We finally evaluate our approach by conducting extensive empirical studies. Empirical results show that the proposed schema can achieve much better performance, in terms of balanced accuracy, than the state of the art. Besides the performance, our approaches can also compute very fast, which further supports their practical deployment.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Financial Reporting Fraud Detection: An Analysis of Data Mining Algorithms

In the last decade, high profile financial frauds committed by large companies in both developed and developing countries were discovered and reported. This study compares the performance of five popular statistical and machine learning models in detecting financial statement fraud. The research objects are companies which experienced both fraudulent and non-fraudulent financial statements betw...

متن کامل

Fraud Magazine

Financial managers, auditors, and fraud examiners are eager to find an automated, timely approach to discovering which publicly traded companies may be harboring financial frauds. Investors, corporate treasurers, and portfolio managers all want a dependable, automatic process to help them sift through voluminous financial data to find the few firms among hundreds that give investors bad informa...

متن کامل

Volatility and Dispersion in Business Growth Rates: Publicly Traded versus Privately Held Firms

We study the variability of business growth rates in the U.S. private sector from 1976 onwards. To carry out our study, we exploit the recently developed Longitudinal Business Database (LBD), which contains annual observations on employment and payroll for all U.S. businesses. Our central finding is a large secular decline in the cross sectional dispersion of firm growth rates and in the averag...

متن کامل

Why Are U.S. Firms Holding So Much Cash? An Exploration of Cross-Sectional Variation

C urrently U.S. corporations have record-high cash holdings. Many argue that this phenomenon is related to the sluggish recovery of the economy: Firms holding more cash are investing less, and this prevents the economy from taking off. While referring to the cash holdings of Apple, the president of a business association stated “Why wasn’t Apple spending that money on expansion, new products an...

متن کامل

A hybrid model based on machine learning and genetic algorithm for detecting fraud in financial statements

Financial statement fraud has increasingly become a serious problem for business, government, and investors. In fact, this threatens the reliability of capital markets, corporate heads, and even the audit profession. Auditors in particular face their apparent inability to detect large-scale fraud, and there are various ways to identify this problem. In order to identify this problem, the majori...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015